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ABSTRACT 

Context. Galactic archaeology aims to determine the evolution of the Galaxy from the chemical and kinematical properties of its 
individual stars. This requires the analysis of data from large spectroscopic surveys, with sample sizes in tens of thousands at present, 
with millions of stars being reached in the near future. Such large samples require automated analysis techniques and classification 
algorithms to obtain robust estimates of the stellar parameter values. Several on-going and planned spectroscopic surveys have selected 
their wavelength region to contain the IR Can triplet (~ AA 8500 A) and the work presented in this paper focuses on the automatic 
analysis of such spectra. 

Aims. We aim to develop and test an automatic method by which one can obtain estimates of values of the stellar atmospheric 
parameters (effective temperature, surface gravity, overall metallicity) from a stellar spectrum. We also explore the degeneracies in 
parameter space, estimate the uncertainties in the derived parameter values and investigate the consequences of these limitations for 
achieving the goals of galactic archaeology. 

Methods. We investigated two algorithms, both of which compare the observed spectrum to a grid of synthetic spectra, but each uses a 
different mathematical approach for finding the optimum match and hence the best values of the stellar parameters. Our investigation 
of these algorithms' robustness can be widely applied because it amplifies the main problems that the other methods can encounter. 
The first algorithm, MATISSE, derives the values of each stellar parameter through a local fit to the spectrum such that each pixel 
in wavelength space is treated separately. The sensitivity of the flux at each wavelength to the value of a given stellar parameter is 
determined from the synthetic spectra. The observed spectrum is then projected using these sensitivity vectors to give an estimated 
value of the stellar parameters. This value depends on finding the true minimum in the fit and the algorithm must avoid being trapped 
in false local minima. The second algorithm, DEGAS, uses a pattern-recognition approach and consequently has a more global vision 
of the parameter space. The best-fit synthetic spectrum is derived through a series of comparisons between the observed and synthetic 
spectra, summed over wavelength pixels, with additional refinements in the set of synthetic spectra after each stage, i.e. a decision 
tree. 

Results. We identified physical degeneracies in different regions of the H-R diagram: hot dwarf and giant stars share the same spectral 
signatures. Furthermore, it is very difficult to determine an accurate value for the surface gravity of cooler dwarfs. These effects are 
intensified when the lack of information increases, which happens for low-metallicity stars or spectra with low signal-to-noise ratios 
(SNRs). Our results demonstrate that the local projection method is preferred for spectra with high SNR, whereas the decision-tree 
method is preferred for spectra of lower SNR. We therefore propose a hybrid approach, combining these methods, and demonstrate 
that sufficiently accurate results for the purposes of galactic archaeology studies are retrieved down to SNR~20 for typical parameter 
values of stars belonging to the local thin or thick disc, and for SNR down to ~50 for the more metal-poor giant stars of the halo. 
Conclusions. If unappreciated, degeneracies in stellar parameters can introduce biases and systematic errors in derived quantities for 
target stars such as distances and full space motions. These can be minimised using the knowledge gained by thorough testing of 
the proposed stellar classification algorithm, which in turn lead to robust automated methods for the coming extensive spectroscopic 
surveys of stars in the Local Group. 

Key words. Stars: fundamental parameters - Stars: abundances - Techniques: spectroscopic - Methods: data analysis 



1. Introduction 

Understanding the formation and evolution of the Milky Way 
Galaxy from the properties of its long-lived constituent stars 
(also known as the field of galactic archaeology) requires the col- 
lection of photometric and/or spectroscopic data for statistically 
significant samples of stars throughout the Galaxy. Photometry 
has the advantage of faster completion of deep wide-field/all- 
sky surveys but spectroscopy provides more accurate, detailed 
information about the target stars. For example, depending on 



the spectral resolution, one can more easily determine overall 
metallicities ([M/H]), the enhancements of a-elements with re- 
spect to iron (with respect to the Sun, [a/Fe]), individual elemen- 
tal abundances and the fundamental stellar parameters effective 
temperature (7W) and surface gravity (log g). The combination 
of these parameters allows one to derive line-of-sight distances 
through a comparison of a given star's position on the H-R di- 
agram and an appropriate set of theoretical isochrones. Full 6D 
phase-space coordinates can be determined if proper motions are 
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available, which additionally constrain models of Galaxy forma- 
tion and evolution. 

The wavelength range around the IR Ca n triplet is more and 
more frequently used in studies of galactic archaeology. The IR 
Can triplet (/U8498.02, 8542.09, 8662.14 A) lines are strong for 
most stellar spectral types and luminosity classes, as well as 
for very metal-poor s tars (see, for example IZwitter et al.ll2004t 
IWilkinson et al.ll2005l) . providing ideal features for a robust de- 
termination of the star's radial velocity. The usefulness of this 
wavelength range goes beyond line-of-sight kinematics, because 
there are numerous absorption lines from many species, includ- 
ing iron and several a-elements (Ca n, Si i, Mg i) and these re- 
main visible even at relatively low spectral resolution and low 
signal-to-noise ratio (SNRQ). The pattern of the ratios of these 
c-elements to iron can be used to trace the star formation time- 
scale in the parent system owing to distinctive roles played by 
the supernovae of different type (and explosion timescale) in 
creating and ejecting the different elements. In addition, an es- 
timate of overall metallicity can be derived from empirical cali- 
brations between metall icity (fM/Hl) and the equivalent widths 
of the Ca n triplet dBattaglia et al.ll2008t IStarkenburg et al.ll2010t 
Fulbright et al.ll2010n . Furthermore. Paschen lines (for example 
^8502.5, 8545.4, 8598.4, 8665.0, 8750.5 A) are visible for stars 
hotter than spectral type G3. The Mgi (/ U8807 A) line, which 
is a useful indicator of surface gravity (see lRuck & Sm ith 1993; 
Battaglia et alj201 ll) . is also visible, even in spectra of low SNR. 
Finally, lines from molecules such as TiO and CN can be seen in 
spectra of the cooler stars. 

The collection of very large samples of spectroscopic data to 
undertake studies in galactic archaeology has become feasible 
in recent years. For example, multifiber instruments such as 
the GIRAFFE/FLAMES spectrograph, mounted on Kueyen, 
the second-unit telescope of the Very Large Telescope (VLT) 
of the European Southern Observatory (ESO) allows spectra 
of more than a hundred objects to be obtained at a time with 
adequate SNR (typically higher than ~20) after only a few 
hours of exposure time. The first galactic archaeology survey 
to use the IR Ca n triplet region has been the RAdial Velocity 
Expe riment survey (RAVE, ,L18410 - 8795 A, see ISteinmetzl 
2003). This targets bright (limit of / ~ 13) stars and uses the 6dF 
multi-object spectrograph on the UKSchmidt telescope with a 
resolution of R= A/AA ~ 7500; the RAVE project has already 
collected more than four hundred thousand spectra. In addition, 
several studies of the stellar populations of the Milky Way 
and its satellite dwarf galaxies have been conducted with the 
FLAMES multifiber spectrograph of ESO, using instrumental 
setups centred on the IR Can triplet (specifically, the LR8 and 
HR21 setups). The s eries of papers related to the ESQ large 
progr ammes DART dTolstov et alJ 120061: iBattaglia et al. 2006, 



2010) and that of Gilmore et al. 171.B-0520(A) (see lWvse etail 



2006; iKoch et alJ I2007L 120081 and references therein) are 



good examples of the use of the IR Can triplet for galactic 
archaeology. Furthermore, in the near future the ESA Gaia 
mission will collect several tens of m illions of spectra wi th its 
Radial Velocity Spectrometer (RVS, IWilkinson et alJl2005l) . at 
a spectral resolution R~ 11500 and two different samplings - 
0.02453 nm and 0.07359 nm (effective resolution of -7000) - 
depending on the brightness of the targets. 

Despite this extensive use of the IR Ca n triplet region and 
its previously mentioned advantages, the determination of the 



1 Throughout the paper, we define the SNR as being the signal-to- 
noise ratio per pixel. 



values of stellar atmospheric parameters from low- to medium- 
resolution spectra in this wavelength domain is far from trivial. 
In particular, as we will see below, there exists a strong degen- 
eracy between the effective temperature and the surface gravity, 
in that varying either produces similar changes in the normalised 
stellar flux of several spectral features. This degeneracy can lead 
to the misclassification of dwarf stars as giant stars, and vice 
versa, a problem that becomes more severe for low-metallicity 
stars. This misclassification obviously creates errors in the de- 
rived stellar distances based on the estimated values of the atmo- 
spheric parameters. The ability to mitigate this degeneracy must 
be built-in to the automated methods of stellar parameter deter- 
mination and that is the only feasible way of dealing with the 
massive datasets produced by the coming galactic archaeology 
surveys. 

The goal of this paper is to show which astrophysical in- 
formation can be retrieved from spectra observed around the 
calcium triplet. We investigate the performances of two meth- 
ods of automated estimation of the values of stellar parameters 
based on very different mathematical approaches, which allows 
us to describe the main problems encountered in this spectral re- 
gion. As a specific example of the application of these two algo- 
rithms, we consider the case of spectra obtained with the LR8 
setup of FLAMES (8206-9400 A, R-6500, sampling=0.2A). 
The spectra obtained using the other instruments and/or setups 
mentioned above (RAVE - RVS - FLAMES/HR21) have either 
broader wavelength ranges or higher spectral resolutions, and 
each of these provides more information that can help to dis- 
entangle some of the degeneracies. Consideration of the degen- 
eracies resulting from spectra obtained with the LR8 setup will 
therefore cover those likely to result from the other setups. 

The structure of this paper is as follows: in Sect. [2] we in- 
troduce the two methods whose performances we test, and in 
Sect.|3]we explain how the synthetic spectra used as training and 
testing sets have been computed. In Sect.|4]we present the perfor- 
mances that are achieved by each of the two applied algorithms 
and present the final adopted strategy for the derivation of the 
values of the stellar atmospheric parameters. Section is ded- 
icated to the application of the method to real spectra, dealing 
with errors in the derived parameter values that are introduced 
by uncertainties in the radial velocity and in the continuum nor- 
malisation. Finally, in Sect. Qwe discuss the effects of the errors 
in the parameter determinations on the science goals of galactic 
archaeology. 



2. Automatic parameter estimation methods 

The parameter estimation problem consists in finding the stellar 
atmospheric parameters (mainly effective temperature, surface 
gravity, global metallicity and individual chemical abundances) 
that define a synthetic spectrum that is an optimal fit to an ob- 
served spectrum. This estimation cannot be made analytically, 
because of the complexity of the theoretical spectra, which in- 
volve the very complex physics included in stellar model at- 
mosphere and spectral line formation theory. As a consequence, 
the stellar spectra parameterisation has to be performed by using 
synthetic spectra grids which span the parameter space that we 
are concerned with here. 

We point out that we favour in our study parameter estima- 
tions from synthetic spectra grids rather than grids of real ob- 
served stars, mainly because a sufficiently accurate parameteri- 
sation of large samples of different types of stars does not exist. 
Therefore, the isolation of flux variations caused by variations 
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of only one of the atmospheric parameters cannot be easily per- 
formed, which is different from synthetic spectra. 

The main problems encountered by the different automated 
estimation methods are caused by the possible non-linearity of 
the model spectra and the non-convexity of the distance function, 
which quantifies the difference between a model and the data. On 
one hand, a change in the atmospheric parameters can induce 
non-linear variations of the spectral flux, which is more impor- 
tant for large-scale parameter variations: two sets of parameters, 
distant in parameter space, can result in very similar spectra. On 
the other hand, the flux changes caused by the variation of a 
given parameter, for instance, the effective temperature, can be 
very similar to those induced by another parameter, usually the 
surface gravity or the metallicity. Therefore, secondary minima 
and multiple solutions to the stellar spectra fitting problem may 
exist . Parameter degeneracy is usually more severe when the 
available information about the parameters decreases: e.g., with 
a more narrow spectral range, lower spectral resolution, lack of 
spectral signatures, and so on. In addition, secondary minima 
can also be artificially generated by noise disturbing the distance 
function. 



particular vector Bg(A) associated to a theoretical #, parameter, 
as follows: 



As presented in Biia oui et alj {2010), several methods exist 
in the literature for the automatic parameter estimation from stel- 
lar spectra. The methods can be separated into i) solving an opti- 
misation problem (e.g. minimum distances, Nelder-Mead algo- 
rithm, Gauss-Newton algorithm), ii) optimising a projection on 
given vectors (e.g. principal component analysis, MATISSE al- 
gorithm, penalized x 2 ), or Hi) as a classification problem (e.g. 
artificial neural networks, support vector machines, decision 
trees). 

Optimisation and projection algorithms usually rely either 
on the convexity of a distance function or the linearity of the 
models. Nevertheless, a local parameter space treatment is some- 
times considered to minimise the failure of these conditions. On 
the other hand, classification algorithms tackle the parameteri- 
sation problem from a completely different point of view: the 
pattern recognition one. This different approach can have many 
advantages in severe conditions of parameter degeneracy. 

As the work of this paper shows, each method has its optimal 
application field, and a combination of algorithms can be neces- 
sary for achieving the best results. We will illustrate the diffi- 
culties that we encountered when performing the stellar param- 
eterisation of IR Ca ii triplet region low -resolution spectra. To 
do this, we will apply two differe nt parameterisation m ethods: a 
projection algorithm, MATISSE ( Reci o-Blanco et al.|[2006b and 
a classification method, DEGAS (Biia oui et alJ 20101) . based on 
an oblique k-d decision tree. Those two algorithms, which are 
representative of two very different mathematical approaches, 
exemplify the main problems that the other methods can en- 
counter when dealing with this kind of spectra. 



2.1. The MATISSE method 

The MATISSE algorithm (MATrix Inversion for Spectral 
SynthEsis) is a local multi-linear regression method. We briefly 
recall now the basic equations of the method, but we encourage 
the rea der to review lRecio-Blanco et al.l(l2006l) and Biiaou i~et~al1 
(2008) for a more comprehensive description. 
MATISSE estimates a 0, stellar atmospheric parameter (r e ff, 
log g, [M/H]) by projecting the observed spectrum O(A) on a 



§i = YjB ei (A)-0(A). 



(1) 



These vectors, called Bg(A) functions hereafter, are com- 
puted during a learning phase from a library of synthetic spectra 
that cover the same wavelength range and have the same spec- 
tral resolution and wavelength sampling as the observed spectra. 
They relate in a quantitative way the pixel-pixel flux variations 
in a spectrum to a given variation of the 0,- parameter. If the Bg(A) 
are orthogonal, the effects from each parameter affect the spec- 
trum in a different way, and therefore the atmospheric parame- 
ters are derived accurately. When this is not the case, possible 
degeneracies in parameter space can occur, which cause a corre- 
lation of the parameter errors. The Bg(A) functions are computed 
from an optimum multi-linear combination of theoretical, syn- 
thetic spectra S (A), as follows: 



B Si (A) = J^atj ■ S j(A), 



(2) 



where the a, ; factor is the weight associated with each synthetic 
spectrum S j{A) to retrieve the 6, parameter. To compute these 
weights during the learning phase, Eq. Q] is applied to a subset 
of synthetic spectra. Accordingly, we obtain by combining Eq.Q] 
and|2 



©, = Ca,, 



(3) 



where C - [cjf] is the correlation matrix between S j and S y, 
and ©, as well as a-, the vectors of the parameters 0, and the 
weights a,j for all considered spectra. The weights ary are then 
computed by inverting the correlation matrix C. 

We have as many Bg(A) functions as there are spectra in the 
library of synthetic spectra. Each of them is computed using 
a small parameter range, for which we assumed the parameter 
variations to have a linear effect on the spectral flux variations. 
In practice, to converge to a parameter sub-space, one can use 
either generic B° g {A) functions, or impose a priori the Bg(A), if a 
first estimation of the parameters is availably When applying 
Eq. Q] to an observed spectrum, the method iterates for as long 
as the derived parameters are not included in the considered pa- 
rameter sub-space for which the Bg(A) functions were computed. 
Usually, the final convergence is attained after a few iterations. 

Here, the MATISSE method deals with normalised spectra 
and consequently all the necessary information for deriving the 
atmospheric parameters is provided by the spectral lines, i.e. 
their relative strengths. This strength changes according to the 
spectral type, the luminosity class and the metal content of the 
stars. In noiseless spectra and when the synthetic spectra per- 
fectly match real stellar spectra, all astrophysical information 
(i.e. spectral lines) can be taken into account during the train- 
ing phase of MATISSE. Nevertheless, for noisier spectra or if 
there is a mismatch between the observed and the synthetic stel- 
lar spectra, it is better to consider only the most relevant features, 
and give less weight to second-order variations. 

The possibility of identifying a specific subset of the spec- 
tral signatures that are to be used for an atmospheric parameter 
derivation, which is a particular feature of MATISSE, allows us 
to adopt an optimised strategy according to the SNR and the 



2 The first estimates can come either from photometric measurements 
or from results of other algorithms. 
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Fig. 1. Illustration of the MATISSE basis functions. For every set of parameters 0, , corresponding to a spectrum in a grid of synthetic 
spectra, a set of Bg(A) was computed. Here, we can see the computed Bt m (A) functions (and the corresponding synthetic spectrum) 
for a star with r e ff=5250 K, log g=4.5, [M/H]=-0.75 dex, [a/Fe]=0.3 dex. For clarity we represent only a part of the wavelength 
range in the plot. The strongest features in the spectrum (top panel), identified with dashed vertical lines, correspond to the Ca n 
triplet. In the second row we show the Bj cK (X) computed with a direct inversion of the correlation matrix, whereas the last two 
panels show the Bj cS (A) functions computed with the Landweber iterative algorithm, imposing a coefficient of correlation between 
the input and the output parameters of 0.97 and 0.80, respectively. As described in Sect. 12.11 the smaller the coefficient, the less 
weight is given to the second-order variations of the spectral flux. The spectral analysis can therefore be optimised according to the 
quality of the spectrum, which is typically quantified in the signal-to-noise ratio. 



type of the star, and therefore to optimise the analysis according 
to the quality of each spectrum. This optimisation is achieved 
when computing the Bg(A) functions, and more precisely while 
inverting the correlation matrix C of Eq. [3] A direct inversion 
would take into account all the n-order variations caused by the 
parameters. If one only considers the first order variations of the 
spectral flux though, one has to approximate the inverse of C. 
The degree of approximation can be controll ed by using an iter - 
ative algorithm, such as that of Landweber (lLandweberlll95lh . 
In that case, one can impose a correlation coefficient between 
the input and the output parameters to be equal to the desired 
degree of approximation. The higher the factor of correlation, 
the lower the degree of approximation. A correlation coefficient 
of one would be similar to a direct inversion. 

In practice, the correlation matrix can sometimes be ill- 
conditioned, and can therefore imply many near-zero eigenval- 
ues. The Landweber algorithm adapts the inversion to the matrix 
conditioning, in the sense that the first eigenvectors are inversed 
during the first iterations, and the inversion of the smallest ones 
need more iterations to be accomplished. The iteration number 
is linked with the congelation coefficient cited above. 

We took the grid of the synthetic spectra described in 
Sect. 13.11 stopped the iterations for the different values of 
correlations of 0.75, 0.80, 0.90, 0.95 and 0.98, and computed the 
Bg(A) functions with the respective approximations of C . This 
is illustrated in Fig.Q] where we plot the same Bj ca {A) function 
computed with different correlation factors. As we notice in this 



figure, all minor features of the spectrum are given a very low 
weight for a coiTelation factor of 0.80 (bottom panel of Fig. [TJ. 
These approximated Bg(A) functions are fully justified as long 
as the spectra are of low quality, where these minor features are 
anyway lost in the noise. 

Application of the various approximated Bg(A) functions on 
noisy synthetic spectra allowed us to select the most suitable 
combination, as a function of spectral type, metallicity, and SNR 
(see Sect.gj). 

2.2. DEGAS: an oblique k-d decision-tree method 

In the limit of the sampling precision, the parameter estimation 
is a recognition problem. The grid of synthetic spectra can be 
treated as a known set of patterns among which we aim to iden- 
tify the observed spectra. In the learning phase, the recognition 
rules are established using the grid of theoretical spectra. 

Decision trees are commonly used for data mining ( Ou inlanl 

1993). At each tree node, a decision is taken to split the data sub- 
set into two or more subsets. The leaf level corresponds to the 
identified classes. A k-d tree is a basic space-partitioning struc- 
ture in a k-dimensional space (in our case, k=3, T e g, log g and 
[M/H]). The decisions result from the projection of the observa- 
tions on a node vector. Classical, or axis-parallel decision trees, 
check only one variable at each node. In the particular case of 
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oblique decision trees, the node vectors (called D n hereafter) are 
obtained from a linear combination of the structural features (i.e. 
the atmosp heric parame ters), resulting in a simpler and more ac- 
curate tree (White 2008). 

The DEGAS (DEcision tree alGorithm for Astrophysics) al- 
gorithm is an oblique k-d decision tree, for which a prelim inary 
version has already been presented in lBiiaoui et al.l yOlO). The 
recognition rules at each node are built during the learning phase 
as follows: 

1 . The mean vector M of the flux values per pixel is computed. 

2. For each spectrum Sj associated to the node, we calculate the 
scalar product c ; = Sj ■ M. Let c be the median value of cj. 

3. The data are bisected into two subsets, T\ and 7*2, according 
to the following criteria: 

S j belongs to the subset T\ if cj < c 
S j belongs to the subset T% if Cj > c 

4. The mean vectors Mi and M2 of each subset are then com- 
puted, and the difference vector D = Mi - M2 is determined. 

5. According to the Huygens theorem, the best separation be- 
tween T\ and T2 corresponds to the case for which the dis- 
persions around Mi and M2 are minimal, which is when M 
and D are parallel. Indeed, we examined the variation V of 
the dispersions that resulted from spectra exchange between 
the two sets T\ and T%. Because the V expression introduced 
the vector D, we chose to get M and D parallel in order to 
maximise V. This dramatically improved the separation com- 
pared to the use of M. Hence, in practice, if the angle be- 
tween M and D is too wide (correlation coefficient smaller 
than 0.999), we iterate until convergence, re-separating the 
data by the hyperplane defined by D (going back to step 2, 
replacing M by D). 

Once this procedure has converged for a particular node n, 
we determine the final projection node vector D„, which will 
display the features that allow one to separate the data at that 
node, and the final median value c„ of cj = Sj ■ D n . 

In this way, the recognition tree with log2(A0 levels is built, 
where N is the number of spectra of the training grid. At the 
lowest level nodes of the tree, the leaves, only one training spec- 
trum remains associated with each node. During the application 
phase, the target data Oj passes through all levels of the recogni- 
tion tree, and a template is associated to it. 

Nevertheless, noise can induce possible misclassifications. 
At each node one has to take into account that the projection co- 
efficient Cj = Oi • D n is distributed according to a Gaussian law, 
defined for the entire real axis. So, theoretically both branches 
can be chosen, and the algorithm should fully explore the deci- 
sion tree. This would lead to an inefficient method. To allow the 
assignation of a probability to each of the directions, we chose 
to replace the Gaussian distribution by an Epanechnikov kernel, 
which corresponds to a truncated parabola. Let us consider 



CTc, 

where cr Ci = «b ■ ■\Jl, / {D, l (A) 2 . If m, < —k we decide that the 
correct direction is 1. If > k direction 2 is chosen. If —k < 
u\ < k both directions are considered. Owing to the threshold, 
generally only one direction is chosen, and in the end only few 
leaves are selected. 

After scanning all nodes, a subset of synthetic templates is 
selected, and the distances of the templates from the observed 



Table 1. Atmospheric parameters of the synthetic spectra of the 
learning grid. 





Range 


Step 


Teg 


3000 ; 8000 K 


200 K between [3000;4000] 
250 K between [4000;8000] 


l°gg 


0.0 ; 5.0 


0.5 dex 


[M/H] 


-5.0 ; +1.0 dex 


1.00 dex between [-5;-3] 
0.50 dex between [-3;-l] 
0.25 dex between [-!;+!] 



spectrum are computed. Then, a weighted mean is evaluated on 
the parameters taking into account these distances, setting 

wf = a-|o 1 -s n | 2 y. (5) 

The value p of the polynomial exponent is fairly arbitrary. If 
the noise is important, the distances between the selected tem- 
plates and the observed spectrum are quite similar, and therefore 
a strong exponent is needed to put more weight to the most sim- 
ilar solution. We tested several values (p = 16, 32, 64, 128) on a 
set of noisy synthetic spectra and decided to use p — 64 through- 
out. 

3. Creating the training and testing spectroscopic 
set 

As noted earlier, most automatic spectral analysis methods, e.g. 
artificial neural networks or maximum likelihood algorithms, re- 
quire a library of synthetic or observed spectra with well-known 
parameters for their learning phase. These reference spectra have 
to cover the whole parameter space T e s, log g, [M/H] where 
the selected targets are expected. No available observed libraries 
with those characteristics that covered our wavelength range 
with sufficiently accurate parameters were available in our case. 
We therefore chose to compute a synthetic library to fulfil our 
objectives. 

3.1. Grid of synthetic spectra for the learning phase. 

A library of synthetic spectra spanning the parameter space 
as detailed in Table Q] was comput ed using MARCS model 
atmosphere s (Gustafsson et ai] l2008l) and the Turbospec code 
varez & Pleall998l and further improvements by B. Plez). 
We assumed a coupling between the overall metallicity and the 
a-element abundances^ according to the commonly observed en- 
hancements in metal-poor galactic stars. We considered 

- [a/Fe]=0.0 dex for 0.0 < [M/H] < +1.0 dex 

- [a/Fe]=+0.1 dex for [M/H]=-0.25 dex 

- [a/Fe]=+0.2 dex for [M/H]=-0.50 dex 

- [a/Fe]=+0.3dexfor[M/H]=-0.75dex 

- [a/Fe]=+0.4 dex for [M/H] < -1.0 dex. 

Each spectrum was computed assuming hydrostatic and lo- 
cal thermodynamic equilibrium (LTE), covering the wavelength 
range 8390-8860 A with a wavelength step of 0.02 A. For stars 
with 3.5 < log g < 5.0 (cgs units), plane-parallel models were 
used, with a microturbulence parameter £=1 km s _1 . For giant 
stars (log g < 3.0), spherical symmetry, with £=2 kms -1 was 
preferred. These models, for which the sphericity effects may 

3 The chemical species considered as or-elements are O , Ne , Mg , Si , 
S , Ar , Ca and Ti . 
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be considerable for low gravities (see iHeiter & Erikssonl l2006). 
have been calculated for a mass of IMq. The final library con- 
tains 2905 spectra of 23501 pixels. Let us note that a few mod- 
els are missing in the synthetic library owing to the approach 
to the Eddington flux li mit or poor convergence, as described in 
iGustafsson etaTI d2008l) . 

This paper deals wit h spectra that will be obtained with the 
LR8 setup of FLAMES (Pasq uini et al.ll2003l) . Therefore, this li- 
brary had to be adapted to the observational setup. The spectra 
were restricted to the wavelengths 8400-8820 A, which contain 
all the predominant lines, and we trimmed out spurious effects 
in the CCDs (border effects, presence of a glow in the red part) 
and possible sky residuals. For this reason, the range between 
8775 A and 8801 A, which contains few iron lines and possibly 
has important sky residuals, was also removed. The spectral fea- 
ture corresponding to Mg i around ~8807 A was kept though. 
The spectra were then convolved with a Gaussian kernel and re- 
binned to match the sampling and resolution of the LR8 spectra. 
In addition, eight re-sampled pixel^] corresponding to the cores 
of the strong Ca n lines were also removed from the spectra (two 
pixels for the first line, and three for the other two lines, corre- 
sponding to 0.8 and 1.2 A, respectively), because a disagreement 
is expected between the synthetic spectra and the true ones for 
these pixels. Indeed, the cores of strong lines have a significant 
contribution from the upper layers of the stellar atmospheres, 
where LTE, one of the assumptions of the model atmospheres 
which were used, is not applicable any more. Removing them 
will accordingly avoid possible biases to the final parameter ex- 
traction. We checked this with the tests presented in Sect. [4] The 
final spectrum contains in the end only 957 pixels with a sam- 
pling of 0.4 A. 

3.2. Line-list calibration 

In order to compute realistic synthetic spectra, it is necessary 
to know the atomic and the molecular parameters of the exist- 
ing lines as accurately as possible . Whereas central wavelengths 
and excitation potentials are fairly well established for most of 
the atomic lines, the probabilities of electronic transitions (il- 
lustrated by the oscillator strengths log gf) are more subject to 
uncertainties. The combination of a line-list, a set of model at- 
mospheres and a spectral synthesis method has to be calibrated 
on standard stars of different spectral types to check the line-list 
quality and obtain reliable synthetic stellar spectra. 
For that purpose, we u sed the set extract ed from the VALD 
database^ in May 2009 (iKupka et al.l l2000) as an initial atomic 
line-list and combined it with MARCS model atmospheres to re- 
prod uce the observed high r esolution, high SNR spectra of the 
Sun dBrault & Neckellll987l) and Arcturus dHinkle et al.1 120031; 
lAllen de Prieto et alj|2004l) as precisely as possible . 

All radiative transfer computations were made assuming 
LTE and hydrostatic equilibrium, and therefore no effort was 
made to fit the cores of strong spectral features better (e.g. Can 
triplet or Mg i line). Instead we removed the pixels correspond- 
ing to the cores of the Ca n triplet as described in the previous 
section. 

The VALD atomic line-list set wa s first modified, a d opting 
the oscillator strength corrections bv IGustafsson et al. (2008) 
for some lines. The molecular line-list includes ZrO, TiO, VO, 
CN, C 2 , CH, SiH, CaH, FeH and MgH lines with their corre- 



4 This corresponds to less than 1% of the total pixels in the spectrum. 

5 http : / /vald . astro . univie . ac . at/ 
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Fig. 2. Input parameters for the testing set. The spectra were 
computed based on a catalogue of pseudo-stars requested from 
the website of the Besancon Galaxy model. 



sponding isotopic variations (kindly pr ovided by B. P lez). The 
adopted solar abundances were th ose of | Grev esse (2008), except 
for CNO, in which case we used[Asplund et al. (2005). In addi- 
tion, we considered £=1 km s _1 and V sin i=2 km s _I . We then 
calibrated manually more than 250 lines to match the observed 
solar spectrum, checking that the overall y 2 between the syn- 
thetic template and the observed spectrum was decreasing with 
each adjustment. 

In addition, roughly 50 lines were calibrated on Arcturus un- 
der the condition that this calibration did not increase the overall 
X 1 for the Sun. We also note that the calibration process did not 
consider a correction of the molecular bands, even though some 
incorrect minor features were noticed. 

For Arc t urus w e used the parameters and the abundances of 
Smit h et alJ (l2000l) : r eff =4300 K, log g=l.l, [M/H]=-0.6 dex, 
[a7Fe]=+0.3 dex, Vsinj=1.5 kms -1 , macroturbulence pa- 
rameter 77=5.2 km s _1 with a radial-tangential profile, and 
£—1-7 km s ' 1 . We note though that the observed spectrum of 
Hink le et al .1 (I2003h was already normalised, but had an imper- 
fect normalisation around the second line of the Ca n triplet, no- 
ticeable as clearly asymmetric wing strengths. 

Finally, because of the fairly poor constraints on its atmo- 
spheric parameters, no line-list calibrations additional to those 
made from the Sun and Arcturus were made on Procyon A 
(spectral type F5-IV). Although we checked if the previ- 
ously made calibrations were reproducing the Procyon observed 
spectrum correctly, which is available from the S 4 N library 
( Allende Priet o et al.1 l2004h . As expected, we found that the 
overall fitting quality was improved compared to the computed 
spectrum using the non-calibrated line-list of VALD. 

3.3. Testing set 

In order to test the MATISSE and the DEGAS algorithms, we 
required a set of synthetic spectra that did not sit on the learning 
grid nodes. We used the specificities of the local Bg(A) functions 
of MATISSE associated with a synthetic spectrum S o(A) with O * 
parameters to compute a set of interpolated spectra. Indeed, the 
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variations from a grid's spectrum Sq(A) with a k parameters can 
be computed according to the following expression: 



(6) 



k=\,K 



k'=\,K 



where B kk > is the correlation matrix between the basis vectors, 
defined as 



(7) 
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We used Eq.[6]to compute a synthetic spectrum of the Sun, and 
compared it to the observed one. The spectra agreed well, vali- 
dating in this way the interpolation routine. 

To test the methods efficiently, these interpolated spectra 
have to represent realistic cases, i.e. they have to follow a 
plausible H-R diagram. For that purp ose, we made us e of the 
Besancon model of the Milky Way (Robi n et alj|2003l) . which 
returns atmospheric parameters of simulated stars towards a 
given line-of-sight. We queried from the website of the Besancon 
mode@ simulated mock catalogues of stars towards the galactic 
bulge, the north galactic pole and towards intermediate latitudes 
(Z = 245°, b = 45°) to model the range of stellar parameters en- 
countered in different galactic stellar populations. 10 4 of these 
stars were selected randomly, to create our testing sample (see 
Fig. [2]). For each of these pseudo-stars, two different metallicity 
values were considered to obtain a catalogue spanning the whole 
metallicity range: the one given by the model of Besancon, and 
another lowered by -0.75 dex. The 2 10 4 spectra of these pseudo- 
stars, were then interpolated from the learning grid using Eq. |6l 
at the sampling and resolution of the FLAMES LR8 spectra. 

Finally, to test the robustness of the methods, four different 
values of white Gaussian noise (SNR -10, 20, 50, 100 pixeF 1 ) 
were used to degrade these spectra, raising the final number of 
testing spectra to 8 10 4 . 



4. Individual performances of the two methods 

We considered that no photometric information is available for 
the stars whose spectra we analysed. We therefore aimed to ex- 
plore the performances of each method without additional data. 
We recall that often this is not true. In particular, this will not be 
the case for the Gaia space mission, because the two spectropho- 
tometers BP/RP together with astrometric information will pro- 
vide a handful of information, constraining that way the allowed 
ranges for T e g and log g. 



4. 1 . Performance of MATISSE 

We ran MATISSE with the first solution found with the B®(A) 
functions as input (i.e. without any a priori information) on 
the 8 10 4 synthetic spectra of the testing set described in 
Sect. I3.3I Figure [3] and Table [2] show, respectively, the evo- 
lution of the shape of the H-R diagram and the relative er- 
rors at 70 % of the error distribution, as a function of the 
SNR for different types of stars. In the left column of Fig. [4] 
we represent the evolution of the typical errors according to 
the SNR for thin disc (-0.25 <[M/H]< 0.5 dex) and thick 
disc (-1.5 <[M/H]< -0.25 dex) dwarf stars and halo giants 
(r eff < 6000 K, log g <3.5, -2.5<[M/H]<-1.25 dex). These val- 
ues (also represented in the last lines of Table |2| were obtained 
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Fig. 3. Dependence of the H-R diagram for spectra analysed 
with the MATISSE algorithm as a function of signal-to-noise 
ratio. The colour code, quantifying metallicity, is the same as in 
Fig. [2] As expected, stars with lower metallicities and hence less 
spectral information are those with the greatest scatter. 



for the above mentioned testing set and the best Bg(A) functions 
combination (see Sect. I2.lt . Two main conclusions can be de- 
rived: 

- Good results are obtained down to SNR~20 for spec- 
tra where a lot of spectral information is available, typi- 
cally, metal-rich dwarfs. The errors increase with decreas- 
ing metallicity (see Table |2). For GV type stars (dwarfs, 
r eff > 5000 - 6000 K), with SNR-50, the internal errors 
on T eS rise from 90 K for metal-rich ([M/H]> -0.5 dex) to 
290 K for metal-poor stars (-l<[M/H]<-2 dex). Errors on 
log g are increased from 0. 15 dex to 0.50 dex, and for metal- 
licity from 0.10 dex to 0.21 dex. 

- Two degeneracy regimes occur. The first, between the hot 
sub-dwarfs (T e ff >6000 K) and the giant branch, is caused 
by to the common spectral signatures shared by T e s and 
log g. The second concerns the cool dwarfs and originates in 
the poor constraints on log g available with this wavelength 
range and resolution. 

These two degeneracies are mainly caused by the behaviour 
of the Ca n lines (the strongest spectral signatures) for different 
spectral types. Indeed, considering that pressure dependence can 
be translated into approximate gravity dependences, the wings 
of the Can lines grow propo rtionally to g 1 ^ for cool main se- 
quence stars (see lGravl l2008). but strongly depend on the T e g. 
For dwarfs with 3000 < T e ff < 5000 K, where no Paschen lines 
are developed, the only relevant spectral signatures concerning 
log g come from the weaker metallic lines which, depending on 
the SNR and the global metallicity of the star, can be lost in the 
noise. 

On the other hand, parameter degeneracies (and also the sec- 
ondary minima) that affect hot dwarfs involve very distant loca- 
tions in parameter space. Indeed, if we compare the spectra of a 
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Fig. 4. Relative errors of the stellar parameters derived by MATISSE (left column), DEGAS (middle column), and the final adopted 
pipeline (right column), at 70% of the error distribution as a function of the signal-to-noise ratio. Intermediate temperature (5000 < 
r eff <6000 K) thin disc (-0.25 <[M/H]< 0.5 dex) and thick disc (-1.5 <[M/H]< -0.25 dex) dwarf stars are represented as black 
diamonds and red triangles symbols, respectively. Halo giant stars (7^ < 6000 K, log g < 3.5, -2.5<[M/H]<-1.25 dex) are blue 
squares. 



star with r eff =6500 K; log g=4.5; [M/H]=-1.0 dex and another 
that is significantly cooler and more giant, with r e ff=5500 K; 
log g=3.0; [M/H]=-1.5 dex (see Fig. [5j, we can see that they 
are almost identical. Only the cores of the strong lines (calcium, 
magnesium) are different, and their wings differ weakly. In ad- 
dition, the few Paschen lines that both spectra exhibit are almost 
identical (only Pal2, around 8750A, is significantly different). 
This additionally supports our removal of the cores of the Ca n 
lines in our automated analysis. Indeed, our tests have revealed 
that avoiding the Ca n line cores in the reference grid helped to 
decrease the number of secondary minima. The disagreement 
between the synthetic templates and the true spectra, in addition 
to the effect of noise on these line-core pixels, can induce impor- 
tant biases, because spectra with similar cores will have a lower 
overall x 1 - The price to pay to avoid these biases is a bigger 
dispersion in the final parameter estimates. This effect becomes 
even more critical when the metallicity decreases, owing to the 
lack of spectral information. 

The absence of distinct spectral signatures can also be seen 
from the angles between the Bg(A) functions at different regions 
of the H-R diagram. We recall that if the Be(A) functions are 
perpendicular, the parameters can be perfectly derived, because 
they use different spectral signatures. For example, a star with 
r e ff=6500 K; log g=4.5; [M/H]=-1.0 dex has an angle between 
Jeff and log g of 33.4°, 66.5° between [M/H] and r eff , and fi- 
nally 89.2° between log g and [M/H]. We can therefore expect 
a degeneracy between the T e g and log g determination, and a 



milder one between T e fi and metallicity, as we noticed with our 
testing set. 

On the other hand, for the cool dwarfs, inaccurate log g 
derivation is only caused by the lack of spectral sensitivity, 
and not because the spectral signatures are shared between the 
parameters. Indeed, for a star with r e fj=4500 K; log g-4.5; 
[M/H] =-1.0 dex the angle between r eff and log g is 89.7°, be- 
tween [M/H] and T e g is 82.9°, and finally between log g and 
[M/H] is 109.9°. 

In terms of^ 2 between the MATISSE parameter solution and 
the true reference values, the secondary minima for cool dwarfs 
lie close to the parameter space formed by T e ff; log g; [M/H] 
and consequently MATISSE still derives accurate parameter es- 
timates, even at low SNR. For hot dwarfs the low-SNR estimates 
are farther from the true parameter values, resulting in higher 
overall errors. Adding noise to the signal will accentuate this be- 
haviour, increase the number of secondary minima, and hence 
decrease the accuracy of MATISSE. 

In addition, Fig. [3] shows that some stars are estimated to 
have log g >5. This effect is more important for low-metallicity 
stars, but also affects metal-rich stars at low SNR. This effect, 
which is typical of the projection algorithms, appears because 
MATISSE may perform extrapolations in parameter space to ob- 
tain the atmospheric parameters. Indeed, the Bg(A) functions on 
which the spectra are projected, still attach too much weight to 
certain spectral signatures even when they are optimised to a 
spectral type and an SNR value, which enhances the sensitiv- 
ity to noise. Of course, results lying outside the grid boundaries 
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Fig. 5. Noise-free synthetic spectra showing the degeneracy in 
parameter space between a hot dwarf star (black, solid line) and 
a cool giant (red dotted line). The spectra differ significantly only 
at the cores of the strong lines and at Pa 12 (~ 8750 A), but their 
wings differ weakly. 
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have to be considered as unreliable and need to be treated sep- 
arately (see Sect. 14.41 for more details on how these values are 
finally treated). 

The correlation of the parameter errors is non-negligible, as 
seen in Fig. [6] which illustrates again the effects of the non con- 
vexity and the degeneracies of the distance function. Indeed, we 
found the well-known r e ff-[M/H] correlation as well as the cor- 
relation between log g - T e s- The correlation between log g 
and [M/H], however, is almost insignificant for metal -rich and 
intermediate-metallicity stars, down to SNR~20. 

4.2. Performances of the DEGAS algorithm 

Unlike MATISSE and its local B g (A) functions, the DEGAS 
method allows a more global view of the parameter space, and 
therefore, it is less affected by local minima traps. Accordingly 
degradation of the parameter accuracies is slow with decreas- 
ing SNR (see Table [3] and the middle column of Fig. 0}. The 
critical SNR value for which DEGAS gives better results than 
MATISSE is around SNR-35. Above that value, DEGAS is in- 
capable of inferring an appropriate interpolated parameter com- 
bination between the grid points (Pig. O because Eq. [5]assigns 
too much weight to the resulting leaves of the decision tree. 
However, the closest grid point (absolute minimum) is usually 
found. Of course this effect results in relatively higher errors 
compared to those of MATISSE. This feature cannot be im- 
proved without affecting the results at lower SNR. 

Nevertheless, even at higher SNR, the pattern recognition ap- 
proach deals better with spectra having low metallities ([M/H] < 
-1 dex). For instance, at SNR-50, a G-type giant with [M/H] < 
-2 dex is expected to have errors of 182 K, 0.43 dex, 0.22 dex 
for J e ff, log g and [M/H], respectively. These accuracies are 
35% better than those obtained with MATISSE. Furthermore, 
the same effect can be observed for KV metal-poor type stars, 
resulting in a (partial) resolution of the thickening of the cool 
part of the main sequence seen with MATISSE (Fig. [3]). 

DEGAS gives quite good results down to SNR~20 for stars 
with [M/H]> -1 dex. Typical errors for giants are 121 K, 
0.29 dex, 0. 13 dex for T e g, log g and [M/H], respectively. These 
values are ~ 168 K, 0.30 dex, 0.14 dex for dwarfs, and hotter 
dwarfs have lower accuracies compared to cooler ones. 
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Fig. 6. Error correlations for MATISSE (left column) and 
DEGAS (right column) for spectra with SNR~20. The colour 
code, which corresponds to the metallicity, is the same as in 
Fig. [2] The general shape of these figures shows the degener- 
acy between the atmospheric parameters. For higher SNRs the 
correlation slopes are similar, but less extended, because the er- 
rors are smaller. The specific properties of each analysis method 
lead to the somewhat different extension and dispersion of the 
correlations. 



Let us stress though that an additional reason why this 
method gives smaller errors compared to MATISSE is that the 
derived parameter values cannot lie outside the grid boundaries. 
This is also illustrated when comparing the dispersion of the er- 
ror correlation for the two algorithms in Fig. [6] However, the 
general shape of the correlations has not changed, as expected, 
because the physical degeneracies are impossible to separate for 
any of the methods. 

4.3. Errors caused by the radial velocity estimation 

The automated stellar parameterisation relies on a good radial 
velocity correction of the observed spectra to the rest frame 
where the synthetic templates are calculated. It is therefore nec- 
essary to test the robustness of each method in the case where 
the spectra are not perfectly at the rest frame. 

To this purpose we used the random set of spectra presented 
in Sect. E3 with SNR- 100, 50, 20 and 10 pixeF 1 , and intro- 
duced four different values of Doppler shifts ( V m d =5, 7, 10 
and 15 km s _1 , corresponding to 1/3, 1/2, 2/3 and 1 pixel shift). 
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Fig. 7. Dependance of the H-R diagram structure recovered with 
the DEGAS algorithm as a function of the spectrum's signal-to- 
noise ratio. The colour code is the same as in Fig. [2] 



We then ran the two algorithms with these spectra and checked 
the resulting error distribution for each parameter. In Fig. [8] we 
show the relative errors obtained for the different radial velocity 
shifts for an SNR=50 and for both methods. The results show 
that the spectra need to be corrected to better than half a pixel 
(corresponding to a V m d accuracy of less than ~ 7 km s~') to ob- 
tain estimates that are still similar to those obtained for spectra 
at the rest frame. Most of the times, this condition is fulfilled, 
because the errors commonly obtained on V ra d for FGK stars at 
SNR~20 in the LR8 setup are less than 5 km s (see the com- 
panion paper Kordopatis et al. 201 lb). However, the atmospheric 
parameter errors, induced by a non-perfect V ra d correction, tend 
to increase more rapidly when the metallicity decreases. This is 
because the few spectral signatures present in the signal have a 
relatively small equivalent width (the spectral lines are spread 
on only a few pixels), and even at high SNR their misplacement 
have a serious effect on the parameter estimations. 

Furthermore, good estimates are obtained for both methods 
for shifts up to 2/3 of a pixel at any SNR for intermediate- to 
high-metallicity stars. We point out though that DEGAS is in 
general more stable than is MATISSE when dealing with spectra 
that are not in the rest frame, especially for metal-poor stars. 

At lower SNR (< 20) both algorithms are stable to Doppler 
shifts and show a similar behaviour. Indeed, for low-quality 
spectra the errors induced by noise dominate those arising from 
a poor radial velocity correction. 

4.4. Summary: adopted strategy 

We decided to set up a procedure that would combine the bene- 
fits of MATISSE (very good results in a local environment, easy 
interpretation) and DEGAS (global view and better tackling of 
the secondary minima problem). The MATISSE algorithm gives 
better results if there are no significant secondary minima in the 
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Fig. 8. Sensitivity to radial velocity errors for MATISSE (left) 
and DEGAS (right). The values at V ra( j = km s _1 correspond 
to the values found in Fig. [4] Additional errors caused by an 
incorrect V rat j correction are almost null as long as the Doppler 
shift remains lower than 7 km s -1 . For lower SNR the robustness 
of the algorithms is higher, which has an almost negligible effect 
even for an extreme velocity error of V ra d=15 km s _1 . 



distance function, because of the local application of its learn- 
ing Be(A) functions. In the particular case of low-resolution IR 
Can spectra, this condition is achieved for data with an SNR 
value higher than ~ 35 and global metallicity [M/H] > -2.0 dex. 
Nevertheless, the use of B° g {A) is not the optimal approach to ob- 
tain the best possible results. Indeed, the number of local min- 
ima tends to increase with noise and the lack of spectral signa- 
tures. Given the properties of the parameter space around the 
Can region, the generic B° g (A) functions will not provide the 
absolute minimum and hence will accentuate the degeneracies. 
Therefore, to fully exploit the capabilities of MATISSE, we de- 
cided to use the results of DEGAS as an initial input. This op- 
eration reduces the effect caused by the secondary minima, and 
can therefore provide more accurate results (up to 50% accuracy 
improvement for and log g). 

For lower SNR (less than 35) and very metal-poor stars, the 
non-convexity of the distance function is too severe to infer good 
results with a projection method such as MATISSE. The use of 
DEGAS is therefore preferred. Indeed, DEGAS can derive errors 
lower by 75% than those of MATISSE for a low-SNR metal-poor 
spectrum. 
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Finally, as seen in Fig. [3] and Fig. [7j because of the non- 
convexity of the distance function, both methods can return re- 
sults in regions of the H-R diagram where no stars can exist. 
Removing these parameter combinations from the possible so- 
lution space will therefore reduce the degeneraci es and increase 
the method accuracy. Based on the Y 2 isochrones dYiet al.l2001l) 
of stars spanning ages from 0.25 Gyr to 15 Gyr and metallici- 
ties from -3 dex to +0.8 dex, we decided to exclude the Bg(A) 
functions and the grid points used for DEGAS, where no pos- 
sible isochrones were nearby. In practice, to avoid too impor- 
tant astrophysical priors in the derived parameters, we removed 
only the templates with log g-5 and r e ff > 6250 K, those with 
r e ff < 4250 K and 4 < log g < 3 dex, as well as all stars with 
[M/H] < -3 dex, r eff < 4000 K and log g < 4 dex. We checked 
in Sect. l5.2l that this selection did not introduce biases in the final 
results. 



5. Final adopted pipeline 

Based on the results and the discussion in the above sections, we 
adopted a final pipeline. This procedure took into account the 
following points: 

- The applied parameterisation method has to be chosen ac- 
cording to its optimal application conditions defined in 
Sect, [ft] 

- Spectra normalisation (when the data are not flux-calibrated) 
is coupled to the atmospheric parameter determination. 

- The parameterisation algorithms are optimised for a given 
SNR value, which implies a robust SNR measurement. 

However, the SNR determination itself depends on the spec- 
tra normalisation and the atmospheric parameter estimation. The 
final adopted pipeline therefore iterates on the parameter estima- 
tion until convergence of the normalisation solution and the SNR 
measurement, as described in the following steps: 

1 . A rough sigma-clipping, polynomial fit normalisation is per- 
formed and an initial value of the SNR is assumed that is 
constant for all the spectra (we adopted an intermediate value 
of SNR ~50). This will give us a first rough estimation of 
the parameters using DEGAS to compute a synthetic spec- 
trum with the parameters of the first guess. Using this tem- 
plate, the SNR is estimated for the first time, as described in 
Sect. ED 

2. DEGAS is re-applied with the newly found SNR values. The 
new atmospheric parameters are then used to compute a syn- 
thetic template and re-normalise the input spectrum. This 
step is repeated several times to obtain a normalisation as 
perfect as possible (see Sect. 15.21 ). Convergence is achieved 
when the continuum shape is unchanged compared to the 
previous iteration. 

3. The SNR is re-computed for the final normalised spectra. If 
its value is higher than 35 and the [M/H] value determined 
by DEGAS is higher then -2.0 dex, MATISSE is run with 
the set of Bg(A) functions corresponding to the spectral type 
and the SNR value. The results of DEGAS from the previous 
step are kept for low-SNR or very metal-poor stars. 

4. For other stars, the final atmospheric parameter estima- 
tions are obtained when the convergence of MATISSE is 
achieved. If the MATISSE results are beyond the parame- 
ter grid boundaries shown in Table Q] the results of DEGAS 
are taken. The final SNR is derived only at the end of this 
step, with a template corresponding to the adopted stellar at- 
mospheric values. 
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Fig. 9. Sensitivity of the recovered H-R diagram for simulated 
data to the spectrum signal-to-noise ratio using the final adopted 
pipeline. The colour code is the same as in Fig. [2] 



5.1. Measurement of the signal-to-noise ratio 

The SNR is estima ted following the procedure described in 
IZwitter et alJ ([2008). First, the atmospheric parameter guesses 
obtained by a given method are used to obtain a synthetic spec- 
trum with the same atmospheric parameters. The difference be- 
tween the template and the observed spectrum is then computed, 
keeping only the pixels where the difference changes sign from 
the previous or the next adjacent pixel and has a relative flux 
close to 1. This selection criterion allows us to keep only the 
continuum pixels, where the difference is caused by pure noise, 
and avoids the selection of pixels that are affected by systematic 
effects (for example due to a bad template selection). This dif- 
ference is then divided by the theoretical spectrum and the SNR 
estimate is obtained by taking the inverse of its standard devia- 
tion. 

As we previously pointed out, the measurement of the SNR 
strongly depends on the normalisation and the synthetic tem- 
plate selection. In addition, the selection of the synthetic tem- 
plate depends on the accuracy of the results of the used method 
(selected Bg(A) functions for MATISSE or the considered leaves 
for DEGAS), which also depend on the SNR. The redundancy 
of the problem therefore requires several iterations before a con- 
vergence of the values can be achieved. For that reason, the SNR 
needs to be estimated several times during our procedure to ob- 
tain a good normalisation (see Sect. 15.21 ). 



5.2. Normalisation effects 

Because the parameter derivation is very sensitive to the relative 
depths of specific lines, a poorly-determined pseudo-continuum 
shape can have sizeable effects. Normalisation is therefore a 
matter of prime importance and special care must be taken. 
Comprehensive tests were made to minimise this effect as much 
as possible. 
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The final adopted procedure consisted of running DEGAS 
several times, renormalising the initially roughly normalised 
spectrum after each iteration. To accomplish this, we use a few 
consecutive iterations to fit the shape of the ratio between the 
interpolated spectrum and the observed spectrum with a fifth- 
order polynomial and with symmetric rejection criteria (clip- 
ping away points farther than 0.5 <x). The continuum shape of 
the observed spectrum is then re-adjusted according to the shape 
of the fitted function. 

To test the robustness of our procedure, we performed var- 
ious tests with the random spectra grid described in Sect. 13.31 
We modified the spectral shape (and hence the continuum) up 
to 3% by multiplying the spectrum with either a slope, a third- 
degree polynomial, or a fifth-degree polynomial. The amplitude 
of 3% is considered to be a pessimistic case for roughly nor- 
malised spectra corrected for instrumental effects. The tests were 
also run on the perfectly normalised spectra, passing through the 
whole pipeline to verify that our procedure is not degrading the 
already perfectly normalised input spectra (see the left column 
of Fig. |U Fig.|9]and Table|4|. The results show that this strategy 
is quite rapidly converging to a stable solution which minimises 
the errors caused by normalisation independent of the deforma- 
tion of the continuum. In addition, as we show in Fig. [10] a good 
re-normalisation is obtained even for the cool metal-rich stars. 
Indeed, in that case the large wings of the spectral lines and the 
huge amount of atomic and molecular lines can be a challenge to 
find the continuum. An unchanged continuum shape is achieved 
after a few iterations. The number of needed iterations depends 
on the SNR, the departure from the true continuum and the spec- 
tral type. Practically, this procedure is performed ten consecutive 
times and convergence is checked. 

It can also be noticed that in some cases, this pipeline derives 
the parameters with even better accuracies than the better of the 
two individual methods (Tables |2]|3] and @). This is because the 
Bg{A) are not used any more, which better constrains the param- 
eter sub-space, as already discussed in Sect. 14.41 Another factor 
that explains the better results is that the final adopted pipeline 
does not consider grid points in unphysical regions of the H-R 
diagram. Nevertheless, the extrapolation properties of MATISSE 
still derive stellar parameters with "unphysical" values in some 
cases, as seen in the two top plots of Fig. [9] 

Finally, the method is robust to deformation of the contin- 
uum, with errors on T^, log g and [M/H] obtained after the 
re-normalisation procedure almost identical (within 10%) with 
the errors obtained for perfectly normalised spectra. 

6. Application to observed spectra 

The errors estimated in the previous sections and illustrated in 
Table [4] can be viewed as the expected relative errors of the 
method when one compares stars within a catalogue of ob- 
served stars. Nevertheless, differences between model spectra 
and real stellar spectra can potentially introduce systematic bi- 
ases. Therefore we needed to test our pipeline with real data. 

We attempted to recover the atmospheric parameters from 
spectra in observed stellar libraries. It was a challenge to find 
spectra that covered the same wavelength region with a suffi- 
ciently high resolution and good pa rameter estimates. Only tw o 
datasets were found: the S A N fromlAllende Prieto et al.1 (12004). 
and the CFLIB described in IValdes et al.l d2004l) . We smoothed 
the spectra from both libraries with a Gaussian kernel to match 

7 A third-degree polynomial was also tested, showing either equiva- 
lent or poorer results than a fifth-order polynomial. 



our working resolving power and re-sampling, and applied the 
whole pipeline. 

The S 4 N catalogue consists of a survey of 118 stars in 
the solar neighbourhood, therefore these stars are mostly metal- 
rich dwarfs. The covered spectral range is very broad (3620- 
9210 A), observed in a single spectral setup at high-resolution 
(R~ 50 000) and high SNR (> 200). Nevertheless, only 68 spec- 
tra include our wavelength range without any inter-order gap. 
The effective temperatures for this catalogue were obtained us- 
ing Stromgrem pho t ome try and the (B-V) colour index with the 
lAlonso et al.l ( 1 19961 Il999t) calibrations. Furthermore, these T e g 
were confirmed by fitti ng synthetic profiles of the H a and Hp 
lines, as described by Barkl em et al.l (120021) and are therefore 
considered reliable. In addition, the gravities published for these 
stars have been derived from the Hipparcos trigonometric paral- 
laxes and are also very reliable. Nevertheless, one may expect 
differences between physical gravities and gravities obtained 
spectroscopically, especially for cool main-sequence stars, for 
which the isochrones give roughly the same value of log g (see 
Fig. ITTTi. In addition, differences are expected for the determina- 
tion of the metallicities. The narrow spectral range and the poor 
spectral signatures available around the IR Can triplet cannot 
be compared to the plethora of lines present in the whole wave- 
length range of the S 4 N. 

Table|5]shows the biases and the dispersions between our de- 
rived par ameters (plotted in F ig.[TT1in red crosses) and the values 
found in lAllende Prieto et al.l (I2004I) . We find that the biases for 
r e ff, log g and [M/H] are reasonable but not negligible, of the 

order of 108 K, -0.21 dex and -0.08 dex, respectively. The 

dispersions are 145 K, 0.32 dex and 0.09 dex, respectively. Let 
us note though that the biases on [M/H] and log g disappear com- 
pletely if we consider the published values derived only from the 
same wavelength range as we use here (strictly, the Gaia RVS 
wavelengt h range, with results k indly provided by C. Allende- 
Prieto, see Allende Prieto (2008) for a detailed description). 

However, the S 4 N library alone is not sufficient to test our 
method fully. It does not cover the astrophysical parameter space 
well enough to fulfil our goals of testing our pipeline over the 
entire H-R diagram, and especially on intermediate and low- 
metallicity stars. The CFLIB librarj0 consists of ~ 900 high 
SNR spectra that cover the entire wavelength range of 3460- 
9464 A with a spectral resolution of 1 .2 A. This catalogue was 
conceived to cover the whole H-R diagram down to metallicities 
of ~ -2.5 dex, with the purpose to be a testing library for au- 
tomatic synthesis methods. The parameters published for these 
stars, however, are a compilation of independent studies of sev- 
eral authors, obtained by various methods. Hence, some disper- 
sion is expected, and the comparison between the method's re- 
sults with the published parameters of this library needs to be 
performed carefully. 

Together with the 900 spectra of the CFLIB catalogue, we 
used the on-line database of PASTEIJ3 (Soubiran et al. 2010) 
instead of the published values in IValdes et al. I (120041) to ob- 
tain updated values for the relevant stellar parameters. We chose 
then to keep only those spectra for which at least two measure- 
ments were available from different authors, and all stars with 
[M/H]< - 1 de£](i gnoring the previous selection criterion for 
metal poor stars). In Fig.QT]we can see the results obtained for 

8 See http : //hebe . as . utexas . edu/s4n 

9 See http://www.noao.edu/cflib 

10 http : //pastel . obs . u-bordeauxl . fr/ 

11 Results of 15 different authors were considered to obtain the mean 
values to which we compare the results of our pipeline. 
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Fig. 10. Re-normalisation process for a cool metal-rich dwarf star. Plot A) shows in black the theoretical spectrum and in red the 
spectrum with a modified continuum (see Sect. 15.2b . which is analysed with our method. The change applied on the input spectrum 
can be viewed in plot B) of this figure, where the ratio between the theoretical spectrum and the input is shown. A corrected 
continuum shape is obtained at the end of our pipeline, as can be seen in the two bottom plots (C and D). 



the 162 CFLIB stars that fulfil our criteria, where the estimates 
from our pipeline are plotted against the mean values found in 
the literature (black dots). The error bars, if any, represent the 
dispersion in the published values. The scatter found is 171 K, 
0.42 dex and 0.21 dex for T e g, log g and [M/H], respectively. 
Table [5] shows the error dispersions for the whole CFLIB li- 
brary, and separately for the dwarfs (log g >3) and the giants 
(log g <3). No significant biases are found, except for a small 
effect (~0.1 dex) in the metallicity of dwarf stars. It can be no- 
ticed that the stars for which our estimates are the most distant 
from the published values are the ones which had only one as- 
sociated measurement. Therefore, it is difficult to know if the 
reason for these outliers is an error of this method or an error in 
the published values. 

We also tested the pipel ine on the spectra of the Sun 
and Arcturu s . The spectra of lAllende Prieto et al. I J2001 and 
iHinkle et all J2003) were degraded to SNR -100 pixel" 1 , with 
200 noise realisations. For each of the considered cases, 400 
spectra were processed. For the Sun the results are very close 
to the standard solar parameter values, except for the metallicity, 
for which a difference of -0. 1 dex is noticed. Similar parameters 
are derived for the T e ff and the [M/H] of Arcturus, but log g is 
less well derived, perh aps because of the imperfect normalisa- 
tion of the spectrum of IHinkle etail (120031) (Sect. Il2l 

We comment briefly on the apparent bias found in metallic- 
ity for the dwarfs. As we can see from Table [5] a similar bias of 



0.1 dex is found for the Sun, the S 4 N stars and the dwarfs 

of CFLIB. We decided to make no zero-point correction because 
the suspected bias is within the error bars found for the metal- 
licity, while inadequate statistics make it impossible to quantify 
any bias as a function of stellar type. Application of the pipeline 
to Arcturus showed no particular bias on the [M/H] parameter. 
As a consequence, given our fairly arbitrary separation between 
metal-rich and metal-poor stars and between giants and dwarfs, 
we decided not to correct any systematic bias. 

Finally, we tested the pipeline by introducing four different 
values of white Gaussian noise (SNR~ 10, 20, 50, lOOpixeL 1 ) to 
the S 4 N and CFLIB spectra. In addition, to increase the statistics 
of our tests, twenty noise realisations for each spectrum were 
performed for each value of the SNR. Table [6] shows the error 
at 70% of the total distribution for each of the libraries. Good 
estimates are found down to SNR~20, as expected, validating in 
this way our method for the case of observed spectra. 



7. Conclusions 

In the era of large spectroscopic surveys, the automated parame- 
terisation of stellar spectra in the infra-red ionised calcium triplet 
region is a fundamental problem that must be addressed and 
solved. We here presented such a method to derive automati- 
cally values of the effective temperatures, surface gravities and 
overall metallicities of observed stars. Alhough the application 
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Fig. 11. Comparisons between the results of our pipeline and values found in the literature for the S N (red crosses) and CFLIB 
(black dots) stellar spectral libraries. The error bars shown for the CFLIB spectra represent the dispersion of the parameter values 
found in the literature, when available. The red dashed diagonal lines, plotted for an easier interpretation of the results, represent 
errors of + 250 K, 0.5 dex and 0.3 dex for T e g, log g and [M/H], respectively. Typical errors of our pipeline are represented in the 
upper left corner of each plot. 



Table 5. Biases and dispersions for the derived parameters of the 
S 4 N and CFLIB libraries. 



T eff logs [M/H] 

(K) (dex) (dex) 

S*N (all) -108 ± 145 -0.21 ± 0.32 -0.08 ± 0.09 
CFLIB (all) 30 ± 171 -0.04 ± 0.42 -0.05 ± 0.21 
CFLIB (dwarfs) -27 ± 156 0.03 ± 0.26 -0.10 + 0.10 
CFLIB (giants) 91 + 118 -0.05 ± 0.45 -0.04 + 0.21 



Table 6. Derived errors for the S 4 N and CFLIB libraries, at dif- 
ferent signal-to-noise ratios. 
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0.53 


0.15 


233 


0.48 


0.27 


-10 


320 


0.61 


0.29 


299 


0.62 


0.37 



discussed in this paper was to spectra with wavelength cover- 
age around ~ 8500A, the method we derived is easily adaptable 
to any other wavelength range and spectral resolution, provided 
there exists a grid of synthetic spectra with the same characteris- 
tics as the observed spectra. Furthermore, the dimensionality of 
the parameter space can be increased from that discussed here, 
if des ired, by adding the [ar/Fe] chemical ratio dGazzano et al.l 
l2010h . 

We studied two different approaches to the problem of au- 
tomated classification, which led to a new hybrid method that 
combines the two approaches. At high SNR, a projection al- 
gorithm, the MATISSE method is preferred, because of its ca- 
pability to interpolate between the spectra of a synthetic grid, 
its lower errors and the easy interpretation of its output. At 
low SNR, the problem of secondary minima in the solution for 
the best-fit synthetic spectrum is better treated with DEGAS, 
the algorithm based on pattern recognition. A quantification 
of the internal errors showed that results that are sufficiently 
accurate for galactic archaeology surveys which aim to study 
the full metallicity distribution (<T[m/h] <0.1 dex), can be ob- 



tained down to SNR-35 for all metal-rich and intermediate- 
metallicity stars ([M/H]< -1 dex). Furthermore, accurate results 
(o"[m/h] <0.2 dex) that suffice to separate the different galactic 
components can be obtained down to SNR-20. Indeed, we ob- 
tained <T [M / H ] <0.12 dex, 0.18 dex and 0.23 dex for typical old 
thin disc, thick disc and halo stars, respectively. 

The application of this hybrid method was tested success- 
fully on stellar libraries of observed targets, validating it for ex- 
tended surveys such as Gaia. Our results show for the first time 
the expected accuracies that are possible from the spectra gath- 
ered by the RVS of Gaia at its low sampling. Furthermore, in ad- 
dition to deriving accurate parameters, the method is well suited 
for big datasets, because it is not very time-consuming compu- 
tationally - processing 2 10 4 spectra takes less than an hour on a 
current laptop. The method is already coded in Java and will be 
delivered to the Gaia Data Processing and Analysis Consortium 
(DPAC) as one of the possible nodes of the whole processing 
pipeline. 

We identified those regions of the H-R diagram where spe- 
cial care must be taken to avoid systematic biases. To illustrate 
this point, consider a survey that mainly targets giant stars. In 
this case, the derived line-of-sight distances, obtained by the pro- 
jection of the estimated stellar atmospheric parameters on a set 
of isochrones, can be seriously under-estimated because a giant 
star can be misclassified as a hot dwarf. Spurious effects can 
therefore be introduced when computing the galactocentric po- 
sitions and space velocities of the sample and therefore com- 
promise any conclusions. Nevertheless, to minimise the effect 
of this degeneracy, we can consider a synthetic grid composed 
only of the spectra of giant stars, and consequently avoid pop- 
ulating regions of the H-R diagram that should be left empty. 
Furthermore, if a variety of spectral types are observed, appro- 
priate photometric ( (B-V) for warm stars, (V-K) for cooler stars) 
measurements can also help to constrain the parameter space. 

The conclusi ons presented in this p aper will be used in a 
companion paper Kordopatis et al. I (l201ll) . where we will present 
the chemical and dynamical properties of roughly 700 stars, se- 
lected to probe the galactic thick disc, outside the solar neigh- 
bourhood. 
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Table 2. Relative errors of MATISSE at 70% of the error distribution. 
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0.81 


0.08 


0.08 


0.18 


0.29 


GII-IV, - l<[M/H]<-0.5 dex 


67 


84 


163 


307 


0.13 


0.19 


0.54 


1.12 


0.07 


0.11 


0.18 


0.39 


GII-IV, -2<[M/H]<-1 dex 


66 


120 


281 


407 


0.18 


0.30 


0.92 


1.52 


0.17 


0.18 


0.25 


0.43 


GII-IV, [M/H]<-2 dex 


131 


297 


528 


729 


0.33 


0.63 


1.85 


2.51 


0.18 


0.34 


0.56 


1.33 


FII-IV, all [M/H] 


80 


110 


127 


150 


0.11 


0.15 


0.17 


0.25 


0.07 


0.10 


0.17 


0.23 


KV, [M/H]>-0.5 dex 


56 


61 


90 


140 


0.09 


0.13 


0.28 


0.54 


0.08 


0.09 


0.15 


0.27 


KV, -l<[M/H]<-0.5 dex 


71 


76 


102 


181 


0.12 


0.17 


0.32 


0.82 


0.09 


0.09 


0.17 


0.32 


KV, -2<[M/H]<-1 dex 


88 


99 


138 


260 


0.17 


0.25 


0.52 


1.78 


0.15 


0.18 


0.24 


0.40 


KV, [M/H]<-2dex 


93 


114 


210 


554 


0.32 


0.36 


0.47 


1.50 


0.16 


0.16 


0.27 


0.58 


GV, >-0.5 dex 


55 


90 


254 


439 


0.08 


0.15 


0.42 


0.86 


0.09 


0.10 


0.23 


0.38 


GV, -l<[M/H]<-0.5 dex 


62 


195 


329 


587 


0.11 


0.33 


0.54 


1.62 


0.08 


0.13 


0.23 


0.39 


GV, -2<[M/H]<-1 dex 


110 


296 


502 


798 


0.17 


0.50 


0.80 


2.52 


0.17 


0.21 


0.33 


0.51 


Ljv, Livi/rij<-z ciex 


A T"> 
4 /Z 




ool 


1 1 

l loo 


n 1 1 


yj.yj 


Z.04 


J.JO 


U.zo 




U.oV 


1 1Q 


FV, >-0.5 dex 


57 


106 


330 


552 


0.10 


0.18 


0.41 


0.84 


0.10 


0.12 


0.26 


0.41 


FV, -l<[M/H]<-0.5 dex 


71 


165 


405 


730 


0.12 


0.27 


0.50 


1.14 


0.10 


0.15 


0.31 


0.46 


FV, -2<[M/H]<-1 dex 


137 


340 


575 


1075 


0.18 


0.43 


0.75 


1.88 


0.17 


0.23 


0.42 


0.68 


FV, [M/H]<-2dex 


1249 


1314 


1389 


1529 


1.28 


1.63 


2.86 


3.43 


1.92 


2.13 


2.23 


2.04 


Thin disc dwarfs 


52 


92 


228 


382 


0.08 


0.15 


0.35 


0.69 


0.09 


0.10 


0.21 


0.34 


Thick disc dwarfs 


67 


207 


364 


662 


0.11 


0.33 


0.53 


1.51 


0.10 


0.13 


0.25 


0.42 


Halo giants 


74 


116 


263 


432 


0.19 


0.34 


1.00 


1.72 


0.16 


0.18 


0.27 


0.45 



Notes. Because of the poor statistics on FII-IV stars, all the metallicities were considered to infer the published value. The three last lines 
represent the accuracies of the method for typical old thin disc dwarfs (log g >3.9, -0.5<[M/H]< -0.25 dex), thick disc dwarfs ((log g >3.9, 
-1.5<[M/H]< -0.5 dex) and halo giants (T eff <6000 K, log g <3.5, -2.5<[M/H]< -1.25 dex). 



Table 3. Relative errors of DEGAS at 70% of the error distribution. 







T e s 


(K) 




log g (dex) 


[M/H] (dex) 


SNR (pixel" 1 ) 


100 


50 


20 


10 


100 


50 


20 


10 


100 


50 


20 


10 


KII-IV, [M/H]>-0.5 dex 


91 


87 


107 


213 


0.16 


0.17 


0.22 


0.53 


0.08 


0.08 


0.09 


0.22 


KII-IV, -l<[M/H]<-0.5 dex 


76 


75 


106 


233 


0.19 


0.20 


0.30 


0.61 


0.08 


0.08 


0.14 


0.24 


KII-IV, -2<[M/H]<-1 dex 


81 


81 


125 


315 


0.19 


0.20 


0.49 


0.93 


0.17 


0.18 


0.21 


0.35 


KII-IV, [M/H]<-2 dex 


77 


78 


263 


399 


0.20 


0.24 


0.54 


0.98 


0.10 


0.16 


0.20 


0.43 


GII-IV [M/H]>-0.5 dex 


91 


91 


100 


274 


0.16 


0.18 


0.24 


0.60 


0.08 


0.08 


0.11 


0.24 


GII-IV -l<[M/H]<-0.5 dex 


78 


88 


171 


301 


0.19 


0.20 


0.38 


0.70 


0.08 


0.09 


0.16 


0.29 


GII-IV -2<[M/H]<-1 dex 


75 


84 


249 


468 


0.19 


0.23 


0.61 


1.04 


0.17 


0.17 


0.24 


0.46 


GII-IV [M/H]<-2 dex 


107 


182 


469 


695 


0.27 


0.43 


1.03 


0.85 


0.18 


0.22 


0.52 


0.66 


FII-IV all [M/H] 


75 


75 


75 


138 


0.12 


0.12 


0.13 


0.14 


0.12 


0.12 


0.20 


0.26 


KV, [M/H]>-0.5 dex 


77 


77 


88 


141 


0.18 


0.18 


0.23 


0.43 


0.09 


0.09 


0.10 


0.17 


KV, -l<[M/H]<-0.5 dex 


81 


80 


98 


160 


0.18 


0.18 


0.26 


0.46 


0.09 


0.09 


0.11 


0.22 


KV, -2<[M/H]<-1 dex 


84 


83 


123 


257 


0.18 


0.19 


0.30 


0.55 


0.17 


0.17 


0.20 


0.30 


KV, [M/H]<-2 dex 


91 


95 


204 


429 


0.13 


0.21 


0.27 


0.78 


0.11 


0.11 


0.23 


0.33 


GV [M/H]>-0.5 dex 


83 


84 


168 


310 


0.11 


0.13 


0.30 


0.49 


0.09 


0.09 


0.12 


0.24 


GV, -l<[M/H]<-0.5 dex 


83 


97 


216 


371 


0.11 


0.14 


0.33 


0.48 


0.09 


0.09 


0.16 


0.30 


GV -2<[M/H]<-1 dex 


99 


121 


320 


656 


0.13 


0.18 


0.40 


0.67 


0.19 


0.20 


0.29 


0.52 


GV [M/H]<-2 dex 


264 


365 


612 


943 


0.26 


0.34 


0.73 


1.09 


0.18 


0.26 


0.46 


0.78 


FV, [M/H]>-0.5 dex 


90 


93 


205 


332 


0.18 


0.18 


0.31 


0.47 


0.09 


0.09 


0.15 


0.27 


FV, -l<[M/H]<-0.5 dex 


93 


101 


232 


411 


0.18 


0.19 


0.36 


0.52 


0.08 


0.09 


0.17 


0.32 


FV, -2<[M/H]<-1 dex 


95 


108 


401 


817 


0.18 


0.20 


0.47 


0.80 


0.19 


0.20 


0.33 


0.65 


FV, [M/H]<-2 dex 


156 


264 


622 


1003 


0.21 


0.30 


0.67 


0.89 


0.18 


0.22 


0.56 


1.06 


Thin disc dwarfs 


84 


85 


142 


278 


0.12 


0.13 


0.26 


0.48 


0.09 


0.09 


0.11 


0.21 


Thick disc dwarfs 


89 


100 


223 


398 


0.13 


0.16 


0.34 


0.50 


0.10 


0.11 


0.19 


0.33 


Halo giants 


78 


85 


195 


441 


0.20 


0.23 


0.58 


0.97 


0.16 


0.17 


0.22 


0.42 



Notes, see Table[2]for a description of the line labels. 
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Table 4. Relative errors of the final adopted pipeline, at 70% of the error distribution. 







T e g 


(K) 




log g (dex) 


[M/H] (dex) 


SNR (pixer 1 ) 


100 


50 


20 


10 


100 


50 


20 


10 


100 


50 


20 


10 


KII-IV, [M/H]>-0.5 dex 


51 


49 


102 


142 


0.10 


0.14 


0.21 


0.41 


0.09 


0.09 


0.10 


0.15 


KII-IV, - l<[M/H]<-0.5 dex 


55 


65 


109 


225 


0.12 


0.17 


0.31 


0.60 


0.08 


0.09 


0.12 


0.20 


KII-IV, -2<[M/H]<-1 dex 


68 


77 


132 


294 


0.15 


0.23 


0.47 


0.85 


0.14 


0.17 


0.22 


0.34 


KII-IV, [M/H]<-2dex 


51 


92 


247 


246 


0.21 


0.47 


0.61 


1.01 


0.14 


0.16 


0.20 


0.15 


Gn-IV, [M/H]>-0.5 dex 


70 


69 


158 


253 


0.10 


0.15 


0.37 


0.67 


0.08 


0.09 


0.14 


0.21 


GII-IV, - l<[M/H]<-0.5 dex 


65 


74 


164 


309 


0.12 


0.19 


0.44 


0.65 


0.07 


0.11 


0.11 


0.25 


GII-IV, -2<[M/H]<-1 dex 


63 


94 


234 


357 


0.17 


0.25 


0.59 


0.84 


0.17 


0.18 


0.24 


0.40 


GII-IV, [M/H]<-2dex 


90 


214 


392 


386 


0.25 


0.48 


0.83 


0.72 


0.17 


0.27 


0.38 


0.43 


FII-IV all [M/H] 


69 


106 


71 


92 


0.14 


0.15 


0.12 


0.10 


0.07 


0.08 


0.15 


0.15 


KV, [M/H]>-0.5 dex 


59 


64 


87 


119 


0.09 


0.12 


0.19 


0.23 


0.08 


0.08 


0.09 


0.14 


KV, -l<[M/H]<-0.5 dex 


73 


79 


95 


158 


0.12 


0.16 


0.20 


0.26 


0.08 


0.09 


0.10 


0.18 


KV, -2<[M/H]<-1 dex 


84 


87 


120 


253 


0.14 


0.17 


0.21 


0.34 


0.15 


0.16 


0.19 


0.31 


KV, [M/H]<-2 dex 


92 


86 


177 


333 


0.17 


0.18 


0.14 


0.85 


0.10 


0.13 


0.19 


0.39 


GV, [M/H]>-0.5 dex 


57 


76 


160 


275 


0.08 


0.12 


0.27 


0.40 


0.09 


0.10 


0.14 


0.29 


GV, -l<[M/H]<-0.5dex 


64 


104 


190 


295 


0.10 


0.16 


0.28 


0.43 


0.08 


0.11 


0.14 


0.22 


GV, -2<[M/H]<-1 dex 


89 


136 


317 


614 


0.14 


0.18 


0.39 


0.70 


0.16 


0.19 


0.30 


0.52 


GV, [M/H]<-2 dex 


169 


328 


654 


756 


0.22 


0.46 


0.67 


0.95 


0.18 


0.26 


0.50 


0.61 


FV, [M/H]>-0.5 dex 


54 


83 


204 


361 


0.11 


0.16 


0.27 


0.43 


0.10 


0.12 


0.19 


0.34 


FV, -l<[M/H]<-0.5 dex 


70 


116 


197 


404 


0.12 


0.20 


0.27 


0.43 


0.09 


0.12 


0.15 


0.29 


FV, -2<[M/H]<-1 dex 


100 


174 


529 


945 


0.16 


0.27 


0.48 


0.85 


0.16 


0.20 


0.40 


0.79 


FV, [M/H]<-2dex 


185 


383 


741 


981 


0.27 


0.46 


0.73 


0.90 


0.27 


0.41 


0.63 


1.03 


Thin disc dwarfs 


55 


70 


147 


267 


0.08 


0.12 


0.24 


0.39 


0.09 


0.09 


0.12 


0.30 


Thick disc dwarfs 


67 


108 


188 


346 


0.11 


0.17 


0.29 


0.43 


0.09 


0.12 


0.18 


0.29 


Halo giants 


65 


94 


188 


335 


0.17 


0.28 


0.57 


0.86 


0.15 


0.17 


0.23 


0.38 



Notes, see Table[2]for a description of the line labels. 
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